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(2) INFORMATION FOR SEQUENCE ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4304 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

AAGCGGCCGG TCTGCAGTCG GAGACTTGCA GGCAGCAAAC ACGGTGCGAA 50 

CGAACCGGAG GGGGGAGAGA GAAATCAAAC AGCTAAGCGT GGAGCAGACG 100 

GCCTGGGACC CAGAAGGGGA TCGATGCGAG GAGCGCAATA ATAACAACAA 150 

TAATAACCCA CTTCGGAGCA AACAGCATCT AAAGAGCTGC GACCCAACTG 200 

CAGCCTAAAA AAATCAAACC TGCTCATGCA CCATGGTTGT TCAAACTCGG 250 

TTCCCTTCGT GGATTATTTT GTGTTACATC TGGCTGCTTG GCTTTGCACA 300 

CACGGGGGAG GCGCAGGCTG CGAAGGAAGT ACTATTACTG GACTCGAAAG 350 

CACAACAAAC AGAATTGGAA TGGATTTCCT CTCCACCCAG TGGGTGGGAA 400 
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GAAATTAGTG GTTTGGATGA GAACTACACT 
GTGCCAGGTC ATGGAGCCCA ACCAGAACAA 
TTTCTAAAGG CAACGCACAA AGGATTTTTG 
AGGGATTGTA ATAGTCTTCC CGGAGTCCTG 
TAATTTGTAC TATTATGAAA CAGACTACGA 
AAAACCTTTA TGTTAAAATA GACACCATTG 
CAAGGTGACC TTGGTGAAAG AAAGATGAAG 
GATTGGACCT TTGTCCAAAA AGGGATTCTA 
GGGCTTGCAT AGCATTGGTT TCTGTCAAAG 
ACCATTGTTG AGAACTTAGC TGTCTTTCCA 
ATTTTCCTCC TTAGTCGAGG TCCGTGGGAC 
AAGAGGCAGA AAATTCCCCC AGAATGCATT 
CTAGTACCCA TTGGAAAATG CATCTGCAAA 
GGACACTTGC GAACCCTGTG GCCGCAGGTT 
ATCTCCAGTG TTCTCGTTGT CCAACCCACA 
TCATCCAGGT GTGAATGTGA AGATGGGTAC 
ACCATACGTT GCATGCACGA GGCCTCCCTC 
TCAATATCAA TCAAACGACT GTAAGTTTGG 
AACGGGGGAA GAAACGATGT CACCTACAGA 
TTGGGAACAG GGAGAATGTG TGCCATGCGG 
CCCAGCAGAC GGGATTAGAG GATAACTATG 
GCCCATGCAA ATTACACTTT CGAAGTTGAA 
CTTAAGCAGA TCCCAGAGGC TCTTCGCTGC 
AAGCAGCTCC CTCGCAAGTG AGTGGAGTCA 
CGGAGTGTGC AGCTTTCCTG GCAGGAGCCG 
CACGGAATAT GAAATCAAGT ATTATGAGAA 
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CCGATAAGAA 


CATACCAGGT 


450 


CTGGCTGCGG 


ACTAACTGGA 


500 


TAGAATTGAA 


ATTCACCTTG 


550 


GGAACTTGCA 


AGGAAACGTT 


600 


CACCGGCAGG 


AATATACGAG 


650 


CTGCAGATGA 


AAGTTTCACA 


700 


CTGAACACTG 


AGGTGAGAGA 


750 


TCTTGCCTTT 


CAGGATGTAG 


800 


TGTACTACAA 


GAAGTGCTGG 


850 


GATACAGTGA 


CTGGTTCGGA 


900 


ATGTGTCAGC 


AGTGCCGAGG 


950 


GCAGTGCAGA 


AGGAGAGTGG 


1000 


GCAGGCTATC 


AGCAAAAAGG 


1050 


CTACAAATCT 


TCCTCTCAGG 


1100 


GCTTCTCTGA 


CCGAGAAGGA 


1150 


TACAGAGCTC 


CTTCTGATCC 


1200 


TGCACCACAG 


AACCTTATTT 


1250 


AATGGAGTCC 


TCCGGCTGAC 


1300 


ATACTGTGTA 


AGCGGTGCAG 


1350 


AAGTAACATT 


GGATACATGC 


1400 


TCACTGTCAT 


GGACCTACTT 


1450 


GCTGTAAATG 


GAGTTTCGGA 


1500 


TGTTAGCATC 


ACCACCGGTC 


1550 


TGAAGGAGCG 


AGTACTGCAG 


1600 


GAGCATCCCA 


ATGGAGTCAT 


1650 


AGATCAACGG 


GAAAGGACGT 


1700 





• 




4 
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ACTCAACACT 


CAAAACCAAG TCCACCTCCG CCTCCATTAA TAATCTGAAA 


1750 


CCGGGAACAG 


TGTACGTCTT TCAGATCCGG 


GCGGTCACTG 


CTGCCGGTTA 


1800 


TGGAAACTAC AGCCCTAGGC TTGATGTTGC CACACTTGAG GAAGCTTCAG 


1850 


GTAAAATGTT TGAAGCGACA GCAGTCTCCA GTGAACAGAA TCCTGTCATC 


1900 


ATAATTGCTG 


TAGTGGCTGT AGCAGGGACC 


ATCATCTTGG 


TGTTCATGGT 


1950 


GTTCGGCTTC 


ATCATTGGAA GAAGGCACTG 


TGGTTATAGC 


AAGGCTGACC 


2000 


AAGAAGGGGA 


TGAAGAACTC TACTTTCATT 


TTAAATTTCC 


AGGCACCAAA 


2050 


ACCTACATTG 


ACCCTGAAAC CTATGAGGAC 


CCAAATAGAG 


CTGTCCATCA 


2100 


ATTCGCCAAG 


GAGCTAGATG CCTCCTGTAT 


TAAAATTGAG 


CGTGTGATTG 


2150 


GTGCAGGAGA ATTTGGAGAA GTTTGCAGTG GTCGTTTGAA ACTTCCGGGC 


2200 


CAGAGAGATG 


TTGCAGTGGC CATAAAAACC 


CTGAAAGTTG 


GTTACACAGA 


2250 


AAAGCAAAGG 


AGGGACTTTT TATGCGAAGC 


AAGCATCATG 


GGGCAATTTG 


2300 


ACCACCCAAA 


TGTCGTCCAT TTGGAAGGGG 


TTGTTACAAG 


AGGGAAGCCT 


2350 


GTCATGATTG 


TGATAGAGTT CATGGAGAAT 


GGAGCCCTGG 


ATGCATTTCT 


2400 


CAGGAAACAC 


GATGGGCAGT TTACAGTCAT 


TCAGTTGGTA 


GGAATGTTGA 


2450 


GAGGTATTGC 


CGCTGGGATG CGATACTTGG 


CTGATATGGG 


ATACGTTCAC 


2500 


AGGGACCTTG 


CAGCGCGCAA CATCCTTGTC 


AACAGCAATC 


TTGTTTGTAA 


2550 


AGTGTCAGAT 


TTTGGCCTTT CCCGGGTTAT 


AGAGGATGAT 


CCCGAAGCTG 


2600 


TCTACACCAC 


GACTGGTGGA AAAATTCCAG 


TAAGGTGGAC 


TGCACCGGAA 


2650 


GCCATTCAAT 


ACCGGAAGTT CACCTCAGCC 


AGCGATGTGT 


GGAGCTATGG 


2700 


GATTGTCATG 


TGGGAAGTGA TGTCTTATGG 


AGAAAGACCT 


TACTGGGACA 


2750 


TGTGAAATCA 


AGATGTCATT AAAGCGATAG 


AAGAAGGTTA 


TCGTTTGCCG 


2800 


GCGCCCATGG 


ATTGCCCAGC TGGTCTTCAC 


CAGCTAATGC 


TGGATTGTTG 


2850 


GCAGAAAGAT 


CGGGCGGAAA GGCCAAAGTT 


TGAGCAGATA 


GTCGGAATTC 


2900 


TAGACAAAAT 


GATTCGAAAC CCAAGTAGTC 


TGAAAACACC 


CCTGGGAACT 


2950 


TGTAGTAGAC 


CCTTAAGCCC TCTTCTGGAC 


CAGAGCACTC 


CTGACTTCAC 


3000 
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TGCCTTCTGT TCAGTTGGAG AATGGTTGCA 
ATAAGGACAA CTTCACAGCA GCGGGTTACA 
AGGATGACTA TCGATGATGT GATGAGTTTA 
TCAAAAGAAG ATCATGAGCA GCATCCAGAC 
ATTTACACGG AACAGGCATC CAAGTGTGAC 
AGGCTTAAGA CTGCAGGAGA ACAGTTCTGG 
TGCTGCTAGA AGACAGTTGA TATACTGGGT 
GATTTTAGAA GCACCTCCAG ACTTGAACTC 
CAAAAAGGGA ATTTAGGATC CACCACTGGT 
CAATAAACAA AGTACTACCT GAAAAACATC 
ACCTCCTTTT TATCTTATAG ACTTTTTAAA 
GAAAGAATAT ATTTGTCAAA TAAAAATCAT 
TGAAATATTT TCCTTAAAAT ATGTGATTTC 
CATCTGTGTT TATTCTGCTT AAGGACTTTG 
CTTTGGACCT TTTTAGTGTT AAATTTATGA 
CTTTGAAGAC TCTCAAACTT AAAGGAAAGC 
GGATGGACTT TGTCCTTCAT GGCTTTGGTA 
TAAACCAGTG ATGTTTTCAT ATTGTTTGCT 
TTGCAAGTTG CCAAGAGCTC TGATATTTTT 
TGTAAAAATC AGATAACATA CTAACTTTTC 
AAGCAATAAT GATCCATAAA TACTATAAGG 
TAGAGTGATT TACTAGGCAG AATTTAATAA 
ATTTTAGGTT TATGTGTATA TGATAAAAGG 
CTGGTGAAAG CAAGACTGGA AGCGAAGCTC 
TCCGAGCACA TCAAGAGCTT CAGTCTTGTG 
ATAGTTGACC TATATTTTGT ATTCTTTCTT 
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AGCTATTAAA AfGGAAAGGT 3050 

ACTCACTCGA GTCAGTGGCC 3100 

GGGATCACAC TGGTTGGCCA 3150 

TATGCGGGCA CAAATGTTGC 3200 

ACATCGGCCT CCCTCAGATG 3250 

CCTTCAGTAT ACGCATAGAA 3300 

CCTTCCTACA AGAAAGAGAA 3350 

CTAAGTGCCA CCAGAATATA 3400 

GGCCAGGAAC ACAGCAGAGA 3450 

CCAACACCTT GAGCTCTCGA 3500 

AATGTACATA AAGAATTTAA 3550 

GATCTTATTG TTAAAATGAA 3600 

AGACTATTCT TTTCCAGAAC 3650 

TTTTAGAAAG TTATTTGTAG 3700 

CACGTTACTA CACTGGGAAC 3750 

AAAACTACGC ACATAGTCGA 3800 

TCCTGGCTGT GTCATTTTGT 3850 

GATTGGCAGG TAGTTCAAAA 3900 

TAACAGGATT TTTTTTTCTT 3950 

AATGAAAAAA AAAAAAAAAG 4000 

CACTTTTAAC AGATTGTTTA 4 050 

AAAAAAAAGA GAGATGTCAA 4100 

CTGAGCTTCG TCTGAAGATG 4150 

TCCAGCTTTG GCTAACCCAA 4200 

ACAGTAAGAA ATTTAGGAAC 4250 

GTTGAATGCA GTCCAAATAC 4300 



108 

AAAA 4304 

(2) INFORMATION FOR SEQUENCE ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 998 amino acids 

(B) TYPE: amino acid 

(C) STRANDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Val Val Gin Thr Arg Phe Pro Ser Trp lie lie Leu Cys Tyr lie 
1 5 10 15 

Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 

20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp lie 
35 40 45 , 

Ser Ser Pro Pro Ser Gly Trp Glu Glu lie Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro lie Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 
65 70 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp lie Ser Lys Gly Asn Ala Gin 

85 90 95 

Arg He Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 
100 105 110 

Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr 
115 * 120 125 

Glu Thr Asp Tyr Asp Thr Gly Arg Asn lie Arg Glu Asn Leu Tyr Val 
130 ' ' 135 140 

Lys He Asp Thr He Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 
145 150 155 160 
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Gly Glu Arg Lys 
Leu Ser 
He Ala 



Lys Lys 
180 



Val Glu 
210 

Ser Ser 
225 

Glu Ala 
Leu Val 



Leu Val 
195 

Asn Leu 
Leu Val 
Glu Asn 



Pro He 
260 



Met Lys Leu 
165 

Gly Phe Tyr 
Ser Val Lys 



Ala Val Phe 
215 

Glu Val Arg 
230 

Ser Pro Arg 
245 

Gly Lys Cys 



Asn Thr Glu 
170 

Leu Ala Phe 
185 

Val Tyr Tyr 
200 

Pro Asp Thr 
Gly Thr Cys 



Met. His Cys 
250 

He Cys Lys 
265 



Val Arg Glu 

Gin Asp Val 

Lys Lys Cys 
205 

Val Thr Gly 
220 

Val Ser Ser 
235 

Ser Ala Glu 
Ala Gly Tyr 



He Gly Pro 
175 

Gly Ala Cys 
190 

Trp Thr lie 
Ser Glu Phe 



Ala Glu Glu 
240 

Gly Glu Trp 
255 

Gin Gin Lys 
270 



Gly Asp Thr Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 
275 280 285 

Gin Asp Leu Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp Arg 
290 295 300 

Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 
305 310 315 320 

Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gin 

325 330 335 

Asn Leu He Phe Asn He Asn Gin Thr Thr Val Ser Leu Glu Trp Ser 
340 345 350 

Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg lie Leu 
355 360 365 

Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys Gly Ser 
370 ^ 375 380 

Asn He Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn Tyr Val 
385 ' 390 395 400 



Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 
405 410 415 



Thr Val 

Ala Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu Phe Ala 
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Ala Val Ser 
435 

is 

Val Met Lys 
450 

Glu Pro Glu 
465 

Tyr Glu Lys 



Ser Thr Ser 



Phe Gin He 
515 

Arg Leu Asp 
530 

Ala Thr Ala 
545 



420 

He Thr 



425 

Thr Gly Gin Ala 
440 



Glu Arg Val Leu Gin Arg 
455 

His Pro Asn Gly Val He 
470 



Asp Gin 
485 

Ala Ser 
500 

Arg Ala 
Val Ala 
Val Ser 



Arg Glu Arg Thr 



He Asn Asn Leu 
505 

Val Thr Ala Ala 
520 

Thr Leu Glu Glu 
535 

Ser Glu Gin Asn 
550 



Val Ala Val Ala 
He He 
Asp Glu 



Gly Arg 
580 



He Asp 
610 

Ala Lys 
625 

Ala Gly 
Gin Arg 
Glu Lys 



Glu Leu 
595 

Pro Glu 
Glu Leu 
Glu Phe 



Asp Val 
660 

Gin Arg 
675 



Gly Thr He 
565 

Arg His Cys 

Tyr Phe His 

Thr Tyr Glu 
615 

Asp Ala Ser 
630 

Gly Glu Val 
645 

Ala Val Ala 
Arg Asp Phe 



430 

Ala Pro Ser Gin Val Ser Gly 
445 

Ser Val Gin Leu Ser Trp Gin 
460 

Thr Glu Tyr Glu He Lys Tyr 
475 480 

Tyr Ser Thr Leu Lys Thr Lys 
490 495 

Lys Pro Gly Thr Val Tyr Val 

510 

Gly Tyr Gly Asn Tyr Ser Pro 
525 

Ala Ser Gly Lys Met Phe Glu 
540 

Pro Val He He He Ala Val 
555 560 

y 

lie Leu Val Phe Met Val Phe Gly Phe 
570 575 



Gly Tyr Ser Lys Ala Asp Gin Glu Gly 
585 590 

Phe Lys Phe Pro Gly Thr Lys Thr Tyr 
600 * 605 

Asp Pro Asn Arg Ala Val His Gin Phe 

620 

Cys He Lys He Glu Arg Val He Gly 
635 640 

Cys Ser Gly Arg Leu Lys Leu Pro Gly 
650 655 

He Lys Thr Leu Lys Val Gly Tyr Thr 
665 670 

Leu Cys Glu Ala Ser He Met Gly Gin 
680 685 
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Phe Asp 
690 

Lys Pro 
705 

Ala Phe 

Gly Met 

Gly Tyr 

Asn Leu 
770 

Asp Asp 
785 

Arg Trp 
Ser Asp 
Gly Glu 



His Pro Asn Val 
Val Met 
Leu Arg 



He Val 
710 



Leu Arg 
740 

Val His 
755 

Val Cys 
Pro Glu 
Thr Ala 



Val Trp 
820 

Arg Pro 
835 



Lys His 
725 

Gly He 

Arg Asp 

Lys Val 

Ala Val 
790 

Pro Glu 
805 

Ser Tyr 
Tyr Trp 



Val His 
695 

He Glu 

Asp Gly 

Ala Ala 

Leu Ala 
760 

Ser Asp 
775 

Tyr Thr 
Ala He 
Gly He 



Asp Met 
840 



111 

Leu Glu 



Gly Val Val Thr Arg Gly 
700 



Phe Met Glu Asn Gly Ala 
715 



Gin Phe 
730 

Gly Met 
745 

Ala Arg 
Phe Gly 
Thr Thr 



Gin Tyr 
810 

Val Met 
825 



Thr Val He Gin 



Arg Tyr Leu Ala 
750 

Asn He Leu Val 
765 

Leu Ser Arg Val 
780 

Gly Gly Lys He 
795 

Arg Lys Phe Thr 



Trp Glu Val Met 
830 



Ser Asn Gin Asp Val He 

845 



Leu Asp 
720 

Leu Val 
735 

Asp Met 

Asn Ser 

He Glu 

Pro Val 
800 

Ser Ala 
815 

Ser Tyr 
Lys Ala 



He Glu 
850 

Leu His 
865 

Pro Lys 
Pro Ser 
Pro Leu 



Glu Gly 
Gin Leu 
Phe Glu 



Ser Leu 
900 

Leu Asp 
915 



Tyr Arg Leu 
855 

Met Leu Asp 
870 

Gin He Val 
885 

Lys Thr Pro 
Gin Ser Thr 



Pro Ala Pro Met 
Cys Trp 
Gly He 



Gin Lys 
875 



Leu Asp 
890 



Asp Cys Pro 
860 

Asp Arg Ala 
Lys Met He 



Leu Gly 
905 

Pro Asp 
920 



Ser Arg Pro 
910 



Ala Gly 



Glu Arg 
880 

Arg Asn 
895 

Leu Ser 



Gly Glu 
930 



Trp Leu 
Thr Ala Ala Gly 



Gin Ala He 
935 



Lys Met 
Tyr Asn Ser Leu Glu 



Thr Cys 

910 

Phe Thr Ala Phe Cys Ser Val 
925 

Glu Arg Tyr Lys Asp Asn Phe 

. 940 

Ser Val Ala Arg Met Thr He 
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945 950 955 960 

Asp Asp Val Met Ser Leu Gly lie Thr Leu Val Gly His Gin Lys Lys 

965 970 975 

lie Met Ser Ser lie Gin Thr Met Arg Ala Gin Met Leu His Leu His 
980 985 990 

Gly Thr Gly He Gin Val 
995 



(2) INFORMATION FOR SEQUENCE ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS) 



(A) LENGTH: / i 610/ amino acids ) 

(B) TYPE: (^aTtrtW-a-cicJ " 

(C) STRANDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Val Val Gin Thr Arg Phe Pro Ser Trp He He Leu Cys Tyr He 
1 5 10 15 

Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 
20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp He 
35 40 45 

Ser Ser Pro Pro Ser Gly Trp Glu Glu He Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro He Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 
65 70 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp He Ser Lys Gly Asn Ala Gin 

85 90 95 

Arg He Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 
100 105 110 

Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr 
115 120 125 
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Glu Thr Asp 
130 

Lys lie Asp 
145 

Gly Glu Arg 

Leu Ser Lys 

lie Ala Leu 
195 

Val Glu Asn 
210 

Ser Ser Leu 
225 

Glu Ala Glu 
Leu Val Pro 



Gly Asp Thr 
275 

Gin Asp Leu 
290 

Glu Gly Ser 
305 



113 

Tyr Asp Thr Gly Arg Asn lie Arg Glu Asn Leu Tyr Val 
135 140 

Thr He Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 
150 155 160 

Lys Met Lys Leu Asn Thr Glu Val Arg Glu He Gly Pro 
165 170 175 

Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys 
180 " 185 190 

Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr He 

200 205 

Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 
215 220 

Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 
230 " 235 240 

Asn Ser Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp 
245 250 255 

He Gly Lys Cys He Cys Lys Ala Gly Tyr Gin Gin Lys 
260 265 270 

Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 

280 " 285 

Gin Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp Arg 
295 * 300 

Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 
310 " 315 320 



Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gin 

325 330 335 



Asn Leu He Phe Asn He Asn Gin Thr Thr Val Ser Leu Glu Trp Ser 
340 345 350 

Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg He Leu 
355 * 360 365 

Cys Lys Arg Cys Ser Trp Glu Gin Gly Glu Cys Val Pro Cys Gly Ser 
370 ~ 375 380 

Asn lie Gly Tyr Met Pro Gin Gin Thr Gly Leu Glu Asp Asn Tyr Val 
385 * 390 395 400 
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Thr Val Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 

405 410 415 

Ala Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu Phe Ala 
420 425 430 

Ala Val Ser He Thr Thr Gly Gin Ala Ala Pro Ser Gin Val Ser Gly 
435 440 445 

Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Gin Leu Ser Trp Gin 
450 455 460 

Glu Pro Glu His Pro Asn Gly Val He Thr Glu Tyr Glu He Lys Tyr 
465 470 475 480 

Tyr Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lys 

485 ~ 490 495 

Ser Thr Ser Ala Ser lie" Asn Asn Leu Lys Pro Gly Thr Val Tyr Val 
500 505 510 

Phe Gin He Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 
515 * 520 525 

Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Gly Lys Met Phe Glu 
530 535 540 

Ala Thr Ala Val Ser Ser Glu Gin Asn Pro Val He He He Ala Val 
545 550 555 560 

Val Ala Val Ala Gly Thr He He Leu Val Phe Met Val Phe Gly Phe 

565 570 575 

He He Gly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gin Glu Gly 
580 .585 590 

r — 

Asp Glu Glu Leu Tyr Phe Hiss Ser Leu Val Thr Asn Glu His Leu Ser 
595 " L-600 605 



Val Leu 
61 



L0 



(2) INFORMATION FOR SEQUENCE ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2901 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: nucleic 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

AAGCGGCCGG TCTGCAGTCG GAGACTTGCA GGCAGCAAAC ACGGTGCGAA 50 

CGAACCGGAG GGGGGAGAGA GAAATCAAAC AGCTAAGCGT GGAGCAGACG 100 

GCCTGGGACC CAGAAGGGGA TCGATGCGAG GAGCGCAATA ATAACAACAA 150 

TAATAACCCA CTTCGGAGCA AACAGCATCT AAAGAGCTGC GACCCAACTG 200 

CAGCCTAAAA AAATCAAACC TGCTCATGCA CCATGGTTGT TCAAACTCGG 250 

TTCCCTTCGT GGATTATTTT GTGTTACATC TGGCTGCTTG GCTTTGCACA 300 

CACGGGGGAG GCGCAGGCTG CGAAGGAAGT ACTATTACTG GACTCGAAAG 350 

CACAACAAAC AGAATTGGAA TGGATTTCCT CTCCACCCAG TGGGTGGGAA 400 

GAAATTAGTG GTTTGGATGA GAACTACACT CCGATAAGAA CATACCAGGT 450 

GTGCCAGGTC ATGGAGCCCA ACCAGAACAA CTGGCTGCGG ACTAACTGGA 500 

TTTCTAAAGG CAACGCACAA AGGATTTTTG TAGAATTGAA ATTCACCTTG 550 

AGGGATTGTA ATAGTCTTCC CGGAGTCCTG GGAACTTGCA AGGAAACGTT 600 

TAATTTGTAC TATTATGAAA CAGACTACGA CACCGGCAGG AATATACGAG 650 

AAAACCTTTA TGTTAAAATA GACACCATTG CTGCAGATGA AAGTTTCACA 700 

CAAGGTGACC TTGGTGAAAG AAAGATGAAG CTGAACACTG AGGTGAGAGA 750 

GATTGGACCT TTGTCCAAAA AGGGATTCTA TCTTGCCTTT CAGGATGTAG 800 

GGGCTTGCAT AGCATTGGTT TCTGTCAAAG TGTACTACAA GAAGTGCTGG 850 

ACCATTGTTG AGAACTTAGC TGTCTTTCCA GATACAGTGA CTGGTTCGGA 900 

ATTTTCCTCC TTAGTCGAGG TCCGTGGGAC ATGTGTCAGC AGTGCCGAGG 950 

AAGAGGCAGA AAATTCCCCC AGAATGCATT GCAGTGCAGA AGGAGAGTGG 1000 

CTAGTACCCA TTGGAAAATG CATCTGCAAA GCAGGCTATC AGCAAAAAGG 1050 

GGACACTTGC GAACCCTGTG GCCGCAGGTT CTACAAATCT TCCTCTCAGG 1100 

ATCTCCAGTG TTCTCGTTGT CCAACCCACA GCTTCTCTGA CCGAGAAGGA 1150 
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TCATCCAGGT 


GTGAATGTGA 


AGATGGGTAC 


TACAGAGCTC 


CTTCTGATCC 


1200 


ACCATACGTT 


GCATGCACGA 


GGCCTCCCTC 


TGCACCACAG 


AACCTTATTT 


1250 


TCAATATCAA 

a> x^* ** * ^ * * *> x** * 


TCAAACGACT 


GTAAGTTTGG 


AATGGAGTCC 


TCCGGCTGAC 


1300 


AACGGGGGAA 

* *A i X-» X^ X^ Xrf Xrf Xrf* A* ™ 


GAAACGATGT 


CACCTACAGA 


ATACTGTGTA 


AGCGGTGCAG 


1350 


TTGGGAACAG 


GGAGAATGTG 


TGCCATGCGG 


AAGTAACATT 


GGATACATGC 


1400 


CCCAGCAGAC 


GGGATTAGAG 


GATAACTATG 


TCACTGTCAT 


GGACCTACTT 


1450 


GCCCATGCAA 

\J X^ X^ \m*A A * X^ Xl> A JU * 


ATTACACTTT 


CGAAGTTGAA 


GCTGTAAATG 


GAGTTTCGGA 


1500 


CTTAAGCAGA 


TCCCAGAGGC 

^ X^ X^ X^* *\J **>x^ \j x* 


TCTTCGCTGC 


TGTTAGCATC 


ACCACCGGTC 


1550 


AAGCAGCTCC 


CTCGCAAGTG 


AGTGGAGTCA 


TGAAGGAGCG 


AGTACTGCAG 


1600 


CGGAGTGTGC 


AGCTTTCCTG 


GCAGGAGCCG 


GAGCATCCCA 


ATGGAGTCAT 


1650 


CACGGAATAT 


GAAATCAAGT 


ATTATGAGAA 


AGATCAACGG 


GAAAGGACGT 


1700 


ACTCAACACT 


CAAAACCAAG 


TCCACCTCCG 


CCTCCATTAA 


TAATCTGAAA 


1750 


CCGGGAACAG 


TGTACGTCTT 

^ XsJ ^ AJvX^X^ a? x^ a a 


TCAGATCCGG 


GCGGTCACTG 


CTGCCGGTTA 


1800 


TGGAAACTAC 


AGCCCTAGGC 

* AW X^ X^ X^ ^ * *W Xrf X^ 


TTGATGTTGC 


CACACTTGAG 


GAAGCTTCAG 


1850 


GTAAAATGTT 


TGAAGCGACA 


GCAGTCTCCA 


GTGAACAGAA 


TCCTGTCATC 


1900 


ATAATTGCTG 


TAGTGGCTGT 


AGCAGGGACC 


ATCATCTTGG 


TGTTCATGGT 


1950 


GTTCGGCTTC 


ATCATTGGAA 


GAAGGCACTG 


TGGTTATAGC 


AAGGCTGACC 


2000 


AAGAAGGGGA 


TGAAGAACTC 


TACTTTCATT 


CTTTAGTAAC 


AAATGAGCAC 


2050 


CTGTCAGTTT 


TATAAAGCGC 

«k A* *4* * ** ** AX* X^ Xrf X* 


AACAATAACT 


GTTTAAGACA 


ATCAATTTTG 


2100 


GATAAACAAT 


CAACTACAGC 


AGAATAAATC 


AAGATTTTTA 


AGTCCCATTT 


2150 


TCCTTTATAC 

^ X^ X* ^ ^ -A. A A A> A *W 


ATTCTGCTTA 

A A A> A X* A> Xrf X* A> 4* A A 


TTTTGTTGTT 


ATATGTTTAT 


TTTTTAAACT 


2200 


CTGATCTTGA 


TTGAATGTGA 


TACCATAAGC 


ACAGTTAGGC 


TGCAGTGTAA 


2250 


ATATATAAAG 


ACATTGTTCT 

A*X*r A^A A> A- X^ A A> A> 


GAGAGCAGTA 


CGATTTCATG 


GAAAGATTGT 


2300 


TTGGTGGCTT 


TGTTAAAATT 


AATAAAGAAT 


TTTTAAGGAT 


ATAGTGTAAT 


2350 


TTTCTTCATT 


GCATTAATAT 

W X^AA A X «AA A A> * * A? 


AACCAAATAT 


GCCTACCTAT 


CTTTGTCTTG 


2400 


AACCAAATGA 


ATAGATTTGG 


AATACTTTAT 


TGTAATTGAA 


TTTGATATAA 


2450 
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AGTTGACTGA 


GCATTTATGT 

Vp**-* A A> A A> A A A* W A 


GTTACCTGCA 

A A * A X** Ak N**J V^A A 


TGCTTCTGGG 

A A A A 


TGCATTGAAA 


2500 


TATTTTAACT 

A A* * A A A A AA *V A» 


TTTAAAATGA 

A* A> Ai A AA AA AA A A> V«A 


TACTATGTTG 

A «*V« A *A A Vrf A A 


TTTCAATTTT 

AAA V»***^ A A A A 


GACTACCTTT 


2550 
-j \j 


TGTGAGGCAT 

A* A ^^A A^^ V** A A> 


ACTGGCTACC 

A A^* A> A> A AV 


TCCTCCTATT 

A> A X* A A A A> A> 


AGCTAAGATC 


TTCCAAAGCC 


2600 


TTATAATGAA 

A JU AA Jb AAA* A> W * AAA 


AAGTTTATAT 

AAA *NJ A> At A» A A A. A A A> 


AAACCATTTC 

A AA AA *V V^A A «l» A> A» 


T CTTTC AAAT 


CACTGTCATA 


2650 


CTTGGTCACG 

A> A> >J -A. V^A AVW 


GATCCCAGGA 


ATATTGTAAA 

A A A A A A A> A A AA AA A 


TTTTCTAATT 

A> A. A. A> A» A AA A A A> 


TACTCTGCAC 

A **>»«■ A W A w V^**V* 


2700 


TTTGTATATC 

AAA Vrf A ** A * * A N"* 


CAGCCTCTAT 

A *V A A> A A A> 


TACCCTCAAG 


GTGAATATAA 

W A X^«*** A *^ A **** 


AACTATGTCT 

*M»V A ** A W A V« A 


2750 

*> / -J V 


TTTGAATATT 

AAA VJ* A A ^* A A 


TCTCTTTGAT 

A V* A >■» AAA A i A 


TTTGTGATAG 

AAA N-J A w* * A • *vJ 


CAGTCCCTCA 


TATCTTGTAC 

X ** X >-» X X VJ X 


2800 


TAATTTTATG 

A A AA A A <*• A An A A A V»J 


TATATGTCAA 

A ATA A. A A A. A> VAM* 


CAGTGGTTGG 

V^A *W A X* XJ A, Aji \^ 


TCTTTAAAAA 


TAAATCAAAG 

A ****** A V^*l***iw 


2850 


AATAAGTAAA 

A AAA A A AA A\a# A. A AA AA A 


AAAAAAAAAA 


AAAAAAAAAA 

A AA AA AA AA AA AA AA AA AA A 


AAAAATAAAA 

AAA AA AA AA A A> A AA MM A 


AAAAAAAAAA 

A AA7AA AA AA AA AA AA AATAA A 


2900 


A 










2901 



(2) 



INFORMATION FOR SEQUENCE ID NO : 5 
(i) SEQUENCE CHARACTERISTICS'! 

(A) LENGTH: / 

(B) TYPE: ( 

(C) STRANDNESST" 

(D) TOPOLOGY: 



626-"amino acids 
"amino acid 
single 
linear 



(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
Met Val Val Gin Thr Arg Phe Pro Ser Trp lie lie Leu Cys Tyr lie 



10 



15 



Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 
20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp lie 
35 40 45 

Ser Ser Pro Pro Ser Gly Trp Glu Glu lie Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro lie Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 
65 70 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp lie Ser Lys Gly Asn Ala Gin 
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Arg lie Phe 

Pro Gly Val 
115 

Glu Thr Asp 
130 

Lys lie Asp 
145 

Gly Glu Arg 

Leu Ser Lys 

lie Ala Leu 
195 

Val Glu Asn 
210 

Ser Ser Leu 
225 

Glu Ala Glu 

Leu Val Pro 

Gly Asp Thr 
275 

Gin Asp Leu 
290 

Glu Gly Ser 
305 

Ser Asp Pro 
Asn Leu lie 



85 

Val Glu Leu 
100 

Leu Gly Thr 

Tyr Asp Thr 

Thr He Ala 
150 

Lys Met Lys 
165 

Lys Gly Phe 
180 

Val Ser Val 

Leu Ala Val 

Val Glu Val 
230 

Asn Ser Pro 
245 

lie Gly Lys 
260 

Cys Glu Pro 
Gin Cys Ser 



Ser Arg Cys 
310 

Pro Tyr Val 
325 

Phe Asn He 
340 



90 

Lys Phe Thr Leu Arg Asp 
105 

Cys Lys Glu Thr Phe Asn 
120 

Gly Arg Asn He Arg Glu 
135 140 

Ala Asp Glu Ser Phe Thr 

155 

Leu Asn Thr Glu Val Arg 
170 

Tyr Leu Ala Phe Gin Asp 
185 

Lys Val Tyr Tyr Lys Lys 
200 

Phe Pro Asp Thr Val Thr 
215 220 

Arg Gly Thr Cys Val Ser 

235 

Arg Met His Cys Ser Ala 
250 

Cys He Cys Lys Ala Gly 
265 

Cys Gly Arg Arg Phe Tyr 
280 

Arg Cys Pro Thr His Ser 
295 300 

Glu Cys Glu Asp Gly Tyr 

315 

Ala Cys Thr Arg Pro Pro 
330 

Asn Gin Thr Thr Val Ser 
345 



95 

Cys Asn Ser Leu 
110 

Leu Tyr Tyr Tyr 
125 

Asn Leu Tyr Val 



Gin Gly Asp Leu 
160 

Glu He Gly Pro 
175 

Val Gly Ala Cys 
190 

Cys Trp Thr He 
205 

Gly Ser Glu Phe 



Ser Ala Glu Glu 
240 

Glu Gly Glu Trp 
255 

Tyr Gin Gin Lys 
270 

Lys Ser Ser Ser 
285 

Phe Ser Asp Arg 



Tyr Arg Ala Pro 
320 

Ser Ala Pro Gin 
335 

Leu Glu Trp Ser 
350 



Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg He Leu 
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355 360 

Cys Lys Arg Cys Ser Trp Glu Gin 
370 ~ * 375 

Asn lie Gly Tyr Met Pro Gin Gin 
385 390 



365 

Gly Glu Cys Val Pro Cys Gly Ser 
380 

Thr Gly Leu Glu Asp Asn Tyr Val 
395 400 



Thr Val Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 

405 410 415 

Ala Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gin Arg Leu Phe Ala 
420 425 430 

Ala Val Ser He Thr Thr Gly Gin Ala Ala Pro Ser Gin Val Ser Gly 
435 440 445 

Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Gin Leu Ser Trp Gin 
450 ~ 455 460 

Glu Pro Glu His Pro Asn Gly Val He Thr Glu Tyr Glu He Lys Tyr 
465 470 475 480 

Tyr Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lys 

485 490 495 

Ser Thr Ser Ala Ser He Asn Asn Leu Lys Pro Gly Thr Val Tyr Val 
500 505 510 

Phe Gin He Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 
515 " 520 525 

Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Gly Lys Met Phe Glu 
530 535 540 

Ala Thr Ala Val Ser Ser Glu Gin Asn Pro Val He He He Ala Val 
545 550 555 560 

Val Ala Val Ala Gly Thr He He Leu Val Phe Met Val Phe Gly Phe 

565 570 575 

He He Gly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gin Glu Gly 
580 585 590 

Asp Glu Glu Leu Tyr Phe His Ser Leu Tyr Arg Glu Arg Gly Asp Gly 
595 ^600 605 

Met Glu Lys Thr Gin His Asn Lys Lys Trp Met He Ala Ser Cys Ser 
610 615 620 



SSSD/27372. vOl 



120 



Arg Leu 

625 J 



(2) INFORMATION FOR SEQUENCE ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

AAGCGGCCGG TCTGCAGTCG GAGACTTGCA GGCAGCAAAC ACGGTGCGAA 50 

CGAACCGGAG GGGGGAGAGA GAAATCAAAC AGCTAAGCGT GGAGCAGACG 100 

GCCTGGGACC CAGAAGGGGA TCGATGCGAG GAGCGCAATA ATAACAACAA 150 

TAATAACCCA CTTCGGAGCA AACAGCATCT AAAGAGCTGC GACCCAACTG 200 

CAGCCTAAAA AAATCAAACC TGCTCATGCA CCATGGTTGT TCAAACTCGG 250 

TTCCCTTCGT GGATTATTTT GTGTTACATC TGGCTGCTTG GCTTTGCACA 300 

CACGGGGGAG GCGCAGGCTG CGAAGGAAGT ACTATTACTG GACTCGAAAG 350 

CACAACAAAC AGAATTGGAA ^TGGATTTCCT CTCCACCCAG TGGGTGGGAA 400 

GAAATTAGTG GTTTGGATGA GAACTACACT CCGATAAGAA CATACCAGGT 450 

GTGCCAGGTC ATGGAGCCCA ACCAGAACAA CTGGCTGCGG ACTAACTGGA 500 

TTTCTAAAGG CAACGCACAA AGGATTTTTG TAGAATTGAA ATTCACCTTG 550 

AGGGATTGTA ATAGTCTTCC CGGAGTCCTG GGAACTTGCA AGGAAACGTT 600 

TAATTTGTAC TATTATGAAA CAGACTACGA CACCGGCAGG AATATACGAG 650 

AAAACCTTTA TGTTAAAATA GACACCATTG CTGCAGATGA AAGTTTCACA 700 

CAAGGTGACC TTGGTGAAAG AAAGATGAAG CTGAACACTG AGGTGAGAGA 750 

GATTGGACCT TTGTCCAAAA AGGGATTCTA TCTTGCCTTT CAGGATGTAG 800 

GGGCTTGCAT AGCATTGGTT TCTGTCAAAG TGTACTACAA GAAGTGCTGG 850 
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ACCATTGTTG 


AGAACTTAGC 


TGTCTTTCCA 


GATACAGTGA 


CTGGTTCGGA 


900 


ATTTTCCTCC 


TTAGTCGAGG 


TCCGTGGGAC 


ATGTGTCAGC 


AGTGCCGAGG 


950 


AAGAGGCAGA 


AAATTCCCCC 


AGAATGCATT 


GCAGTGCAGA 


AGGAGAGTGG 


1000 


CTAGTACCCA 


TTGGAAAATG 


CATCTGCAAA 


GCAGGCTATC 


AGCAAAAAGG 


1050 


GGACACTTGC 


GAACCCTGTG 


GCCGCAGGTT 


CTACAAATCT 


TCCTCTCAGG 


1100 


ATCTCCAGTG 


TTCTCGTTGT 


CCAACCCACA 


GCTTCTCTGA 


CCGAGAAGGA 


1150 


TCATCCAGGT 


GTGAATGTGA 


AGATGGGTAC 


TACAGAGCTC 


CTTCTGATCC 


1200 


ACCATACGTT 


GCATGCACGA 


GGCCTCCCTC 


TGCACCACAG 


AACCTTATTT 


1250 


TCAATATCAA 


TCAAACGACT 


GTAAGTTTGG 


AATGGAGTCC 


TCCGGCTGAC 


1300 


AACGGGGGAA 


GAAACGATGT 


CACCTACAGA 


ATACTGTGTA 


AGCGGTGCAG 


1350 


TTGGGAACAG 


GGAGAATGTG 


TGCCATGCGG 


AAGTAACATT 


GGATACATGC 


1400 


CCCAGCAGAC 


GGGATTAGAG 


GATAACTATG 


TCACTGTCAT 


GGACCTACTT 


1450 


GCCCATGCAA 


ATTACACTTT 


CGAAGTTGAA 


GCTGTAAATG 


GAGTTTCGGA 


1500 


CTTAAGCAGA 


TCCCAGAGGC 


TCTTCGCTGC 


TGTTAGCATC 


ACCACCGGTC 


1550 


AAGCAGCTCC 


CTCGCAAGTG 


AGTGGAGTCA 


TGAAGGAGCG 


AGTACTGCAG 


1600 


CGGAGTGTGC 


AGCTTTCCTG 


GCAGGAGCCG 


GAGCATCCCA 


ATGGAGTCAT 


1650 


CACGGAATAT 


GAAATCAAGT 


ATTATGAGAA 


AGATCAACGG 


GAAAGGACGT 


1700 


ACTCAACACT 


CAAAACCAAG 


TCCACCTCCG 


CCTCCATTAA 


TAATCTGAAA 


1750 


CCGGGAAGAG 


TGTACGTCTT 


TCAGATCCGG 


GCGGTCACTG 


CTGCCGGTTA 


1800 


TGGAAACTAC 


AGCCCTAGGC 


TTGATGTTGC 


CACACTTGAG 


GAAGCTTCAG 


1850. 


GTAAAATGTT 


TGAAGCGACA 


GCAGTCTCCA 


GTGAACAGAA 


TCCTGTCATC 


1900 


ATAATTGCTG 


TAGTGGCTGT 


AGCAGGGACC 


ATCATCTTGG 


TGTTCATGGT 


1950 


GTTCGGCTTC 


ATCATTGGAA 


GAAGGCACTG 


TGGTTATAGC 


AAGGCTGACC 


2000 


AAGAAGGGGA 


TGAAGAACTC 


TACTTTCATT 


CTCTTTACAG 


GGAAAGGGGA 


2050 


GACGGGATGG 


AAAAGACACA 


GCACAATAAG 


AAGTGGATGA 


TTGCATCGTG 


2100 


CTCTCGTTTG 


TAGGTCTCTT 


TTCCTAATCA 


ACACTATGAT 


TTTGAAGTAC 


2150 
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GCGTACACGA AGCAAACGGG AAGAGATAAG 
ACTGTAATCC TCTCTTCCGG AAAGAGATGA 
GTACAACTTG CACCTTGAAA TCTTTTTTGA 
GGGGGGGAAG TAGAGAAAGC AAA 




122 

GAATTAGCAT TGTGAACCTG 2200 
GATGCTATTG CGATGAGAAT 2250 
TAATTAGTGC TCAGGGGAGG 2300 

2323 



(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 



6 amino acids 
amino acid 
single 
linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 

Ala Ala Thr Ala Ala Ala 

5 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

■ (B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 

(xi) SEQUENCE DESCRIPTION: 

Ala Ala Thr Ala Ala Ala 

5 



6 amino acids 
amino acid 
single 
linear 

peptide 

SEQ ID NO: 8: 



(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 



SSSD/27372. vOl 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 



123 

6 amino acids 
amino acid 
single 
linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

His Arg Asp Leu Ala Ala 

5 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 



6 amino acids 
amino acid 
single 
linear 



(ii) MOLECULE TYPE: peptide 
(ix)' FEATURE: 

(D) OTHER INFORMATION: 



Xaa in position 2 is valine 
or methionine; Xaa in 
position 5 is phenylalanine 
or tyrosine. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Asp Xaa Trp Ser Xaa Gly 

5 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 



993 amino acids 
amino acid 
single 
linear 



(ii) MOLECULE TYPE: 



peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Val Val Gin Thr Arg Phe Pro Ser Trp lie lie Leu Cys Tyr lie 
1 5 10 15 

Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 
20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp lie 
35 " 40 45 

Ser Ser Pro Pro Ser Gly Trp Glu Glu lie Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro lie Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 
65 70 * 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp lie Ser Lys Gly Asn Ala Gin 

85 90 95 

Arg lie Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 
100 105 110 

Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr 
115 ~ * 120 125 

Glu Thr Asp Tyr Asp Thr Gly Arg Asn lie Arg Glu Asn Leu Tyr Val 
130 135 140 

Lys lie Asp Thr lie Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 
145 150 155 160 

Gly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Arg Glu lie Gly Pro 

165 170 175 

Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys 
180 * * 185 190 

lie Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr lie 
195 200 205 

Val Glu Asn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 
210 215 220 

Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 
.225 230 ~ 235 240 

Glu Ala Glu Asn Ser Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp 

245 250 255 

Leu Val Pro lie Gly Lys Cys lie Cys Lys Ala Gly Tyr Gin Gin Lys 
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Gly Asp Thr 
275 

Gin Asp Leu 
290 

Glu Gly Ser 
305 

Ser Asp Pro 
Asn Leu lie 



Pro Pro Ala 
355 

Cys Lys Arg 
370 

Asn lie Gly 
385 

Thr Val Met 



Ala Val Asn 



260 

Cys Glu Pro Cys 



Gin Cys 

Ser Arg 

Pro Tyr 
325 

Phe Asn 
340 

Asp Asn 
Cys Ser 
Tyr Met 



Asp Leu 
405 

Gly Val 
420 



Ser Arg 
295 

Cys Glu 
310 

Val Ala 

lie Asn 

Gly Gly 

Trp Glu 
375 

Pro Gin 
390 

Leu Ala 
Ser Asp 



265 

Gly Arg Arg 
280 

Cys Pro Thr 

Cys Glu Asp 

Cys Thr Arg 
330 

Gin Thr Thr 
345 

Arg Asn Asp 
360 

Gin Gly Glu 

Gin Thr Gly 

His Ala Asn 
410 

Leu Ser Arg 
425 



Phe Tyr Lys 
285 

His Ser Phe 
300 

Gly Tyr Tyr 
315 

Pro Pro Ser 

Val Ser Leu 

Val Thr Tyr 
365 

Cys Val Pro 
380 

Leu Glu Asp 
395 

Tyr Thr Phe 
Ser Gin Arg 



270 

Ser Ser Ser 

Ser Asp Arg 

Arg Ala Pro 
320 

Ala Pro Gin 
335 

Glu Trp Ser 
350 

Arg lie Leu 
Cys Gly Ser 



Asn Tyr Val 
400 

Glu Val Glu 
415. 

Leu Phe Ala 
430 



Ala Val Ser He Thr Thr Gly Gin Ala Ala Pro Ser Gin Val Ser Gly 
435 440 445 

Val Met Lys Glu Arg Val Leu Gin Arg Ser Val Gin Leu Ser Trp Gin 
450 " ~ 455 460 



Glu Pro Glu His Pro Asn Gly Val He Thr Glu Tyr Glu 
465 470 * 475 

Tyr Glu Lys Asp Gin Arg Glu Arg Thr Tyr Ser Thr Leu 

485 490 

Ser Thr Ser Ala Ser He Asn Asn Leu Lys Pro Gly Thr 
500 505 



He Lys Tyr 
480 

Lys Thr Lys 
495 

Val Tyr Val 
510 



Phe Gin He Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 
515 520 525 
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Arg Leu Asp Val Ala Thr Leu Glu Glu 
530 535 

Ser Glu Gin Asn Pro Val lie lie lie 
545 550 

Thr He He Leu Val Phe Met Val Phe 

565 

His Cys Gly Tyr Ser Lys Ala Asp Gin 
580 585 

Phe His Phe Lys Phe Pro Gly Thr Lys 
595 600 

Tyr Glu Asp Pro Asn Arg Ala Val His 
610 615 

Ala Ser Cys He Lys He Glu Arg Val 
625 630 

Glu Val Cys Ser Gly Arg Leu Lys Leu 

645 

Val Ala He Lys Thr Leu Lys Val Gly 

660 665 

Asp Phe Leu Cys Glu Ala Ser He Met 
675 680 

Val Val His Leu Glu Gly Val Val Thr 
690 695 

Val He Glu Phe Met Glu Asn Gly Ala 
705 710 



Ala Ser Ala Thr Ala Val Ser 
540 

Ala Val Val Ala Val Ala Gly 
555 560 

Gly Phe He He Gly Arg Arg 
570 575 

Glu Gly Asp Glu Glu Leu Tyr 

590 

Thr Tyr He Asp Pro Glu Thr 
605 

Gin Phe Ala Lys Glu Leu Asp 
620 

He Gly Ala Gly Glu Phe Gly 
635 640 

Pro Gly Gin Arg Asp Val Ala 
650 655 

Tyr Thr Glu Lys Gin Arg Arg 

670 

Gly Gin Phe Asp His Pro Asn 
685 

Arg Gly Lys Pro Val Met He 
700 

Leu Asp Ala Phe Leu Arg Lys 

. 715 " 720 



His Asp Gly Gin Phe Thr Val He Gin Leu Val Gly Met Leu Arg Gly 

725 730 735 

He Ala Ala Gly Met Arg Tyr Leu Ala Asp Met Gly Tyr Val His Arg 
740 745 750 

Asp Leu Ala Ala Arg Asn He Leu Val Asn Ser Asn Leu Val Cys Lys 
755 760 765 

Val Ser Asp Phe Gly Leu Ser Arg Val He Glu Asp Asp Pro Glu Ala 
770 " 775 780 

Val Tyr Thr Thr Thr Gly Gly Lys He Pro Val Arg Trp Thr Ala Pro 
785 790 795 800 
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Glu Ala He Gin Tyr Arg Lys Phe Thr Ser Ala Ser Asp Val Trp Ser 

805 810 815. 

Tyr Gly He Val Met Trp Glu Val Met Ser Tyr Gly Glu Arg Pro Tyr 
820 " 825 830 

Trp Asp Met Ser Asn Gin Asp Val He Lys Ala He Glu Glu Gly Tyr 
835 840 845 

Arg Leu Pro Ala Pro Met Asp Cys Pro Ala Gly Leu His Gin Leu Met 
850 855 860 

Leu Asp Cys Trp Gin Lys Asp Arg Ala Glu Arg Pro Lys Phe Glu Gin 
865 870 875 880 

He Val Gly He Leu Asp Lys Met He Arg Asn Pro Ser Ser Leu Lys 

885 890 895 

Thr Pro Leu Gly Thr Cys Ser Arg Pro Leu Ser Pro Leu Leu Asp Gin 
900 * ~ 905 910 

Ser Thr Pro Asp Phe Thr Ala Phe Cys Ser Val Gly Glu Trp Leu Gin 
915 920 925 

Ala He Lys Met Glu Arg Tyr Lys Asp Asn Phe Thr Ala Ala Gly Tyr 
930 935 940 

Asn Ser Leu Glu Ser Val Ala Arg Met Thr He Asp Asp Val Met Ser 
945 950 955 960 

Leu Gly He Thr Leu Val Gly His Gin Lys Lys He Met Ser Ser He 

965 * 970 975 

Gin Thr Met Arg Ala Gin Met Leu His Leu His Gly Thr Gly He Gin 
980 985 990 

Val 



(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 994 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Val Val Gin Thr Arg Phe Pro Ser Trp lie lie Leu Cys Tyr lie 
1 5 10 15 

Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gin Ala Ala Lys Glu 
20 25 30 

Val Leu Leu Leu Asp Ser Lys Ala Gin Gin Thr Glu Leu Glu Trp lie 
35 40 45 

Ser Ser Pro Pro Ser Gly Trp Glu Glu lie Ser Gly Leu Asp Glu Asn 
50 55 60 

Tyr Thr Pro lie Arg Thr Tyr Gin Val Cys Gin Val Met Glu Pro Asn 
65 70 * 75 80 

Gin Asn Asn Trp Leu Arg Thr Asn Trp lie Ser Lys Gly Asn Ala Gin 

85 90 95 

Arg lie Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 
100 105 110 

Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr 
115 * 120 125 

Glu Thr Asp Tyr Asp Thr Gly Arg Asn lie Arg Glu Asn Leu Tyr Val 
130 135 140 

Lys He Asp Thr He Ala Ala Asp Glu Ser Phe Thr Gin Gly Asp Leu 
145 . 150 155 160 

Gly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Arg Glu lie Gly Pro 

165 170 175 

Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys 
180 185 . 190 

He Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr He 
195 200 205 

Val Glu Asn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 
210 215 220 

Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 
225 230 235 240 
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Glu Ala Glu Asn 
Leu Val 
Gly Asp 



Pro lie 
260 



Gin Asp 
290 

Glu Gly 
305 



Thr Cys 
275 

Leu Gin 



Ser Ser 



Ser Asp Pro Pro 
Asn Leu 



Pro Pro 

Cys Lys 
370 

Asn lie 
385 

Thr Val 
Ala Val 
Ala Val 



He Phe 
340 

Ala Asp 
355 

Arg Cys 
Gly Tyr 
Met Asp 



Asn Gly 
420 

Ser He 
435 



Val Met 
450 



Lys Glu 
Glu His 
Tyr Glu Lys Asp 



Glu Pro 
465 



Ser Pro 
245 

Gly Lys 

Glu Pro 

Cys Ser 

Arg Cys 
310 

Tyr Val 
325 

Asn He 

Asn Gly 

Ser Trp 

Met Pro 
390 

Leu Leu 
405 

Val Ser 
Thr Thr 
Arg Val 



Pro Asn 
470 

Gin Arg 
485 



Arg Met His Cys 
250 

Cys He Cys Lys 
265 

Cys Gly Arg Arg 
280 

Arg Cys Pro Thr 
295 

Glu Cys Glu Asp 



Ala Cys Thr Arg 
330 

Asn Gin Thr Thr 
345 

Gly Arg Asn Asp 
360 

Glu Gin Gly Glu 
375 

Gin Gin Thr Gly 



Ala His Ala Asn 
410 

Asp Leu Ser Arg 
425 

Gly Gin Ala Ala 
440 

Leu Gin Arg Ser 
455 

Gly Val He Thr 



Glu Arg Thr Tyr 
490 



Ser Ala Glu Gly Glu Trp 

255 

Ala Gly Tyr Gin Gin Lys 
270 

Phe Tyr Lys Ser Ser Ser 
285 

His Ser Phe Ser Asp Arg 
300 

Gly Tyr Tyr Arg Ala Pro 
315 * " 320 

Pro Pro Ser Ala Pro Gin 

335 

Val Ser Leu Glu Trp Ser 
350 

Val Thr Tyr Arg He Leu 
365 

Cys Val Pro Cys Gly Ser 
380 

Leu Glu Asp Asn Tyr Val 
395 400 

Tyr Thr Phe Glu Val Glu 

415 

Ser Gin Arg Leu Phe Ala 
430 

Pro Ser Gin Val Ser Gly 
445 



Val Gin 
460 

Glu Tyr 
475 



Leu Ser Trp Gin 



Glu He Lys Tyr 
480 



Ser Thr Leu Lys Thr Lys 

495 



Ser Thr Ser Ala Ser He Asn Asn Leu Lys Pro Gly Thr Val Tyr Val 
500 505 510 
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Phe Gin He Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 
515 520 525 

Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Gly Lys Met Phe Glu 
530 535 540 

Ala Thr Ala Val Ser Ser Glu Gin Asn Pro Val He He He Ala Val 
545 550 555 560 

Val Ala Val Ala Gly Thr He He Leu Val Phe Met Val Phe Gly Phe 

565 570 575 

He He Gly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gin Glu Gly 
580 585 590 

Asp Glu Glu Leu Tyr Phe His Cys Thr Lys Thr Tyr He Asp Pro Glu 
595 600 605 

Thr Tyr Glu Asp Pro Asn Arg Ala Val His Gin Phe Ala Lys Glu Leu 
610 " 615 620 

Asp Ala Ser Cys He Lys He Glu Arg Val He Gly Ala Gly Glu Phe 
625 ~ 630 635 640 

Gly Glu Val Cys Ser Gly Arg Leu Lys Leu Pro Gly Gin Arg Asp Val 

645 * ~ 650 655 

Ala Val Ala He Lys Thr Leu Lys Val Gly Tyr Thr Glu Lys Gin Arg 
660 665 670 

Arg Asp Phe Leu Cys Glu Ala Ser He Met Gly Gin Phe Asp His Pro 
675 680 685 

Asn Val Val His Leu Glu Gly Val Val Thr Arg Gly Lys Pro Val Met 
690 695 700 

He Val He Glu Phe Met Glu Asn Gly Ala Leu Asp Ala Phe Leu Arg 
705 710 715 720 

Lys His Asp Gly Gin Phe Thr Val He Gin Leu Val Gly Met Leu Arg 

725 730 735 

Gly He Ala Ala Gly Met Arg Tyr Leu Ala Asp Met Gly Tyr Val His 
740 * 745 . 750 

Arg Asp Leu Ala Ala Arg Asn He Leu Val Asn Ser Asn Leu Val Cys 
755 760 765 

Lys Val Ser Asp Phe Gly Leu Ser Arg Val He Glu Asp Asp Pro Glu 
770 ' 775 780 
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Ala Val Tyr Thr Thr Thr Gly Gly Lys lie Pro Val Arg Trp Thr Ala 
785 790 * 795 ~ 800 

Pro Glu Ala lie Gin Tyr Arg Lys Phe Thr Ser Ala Ser Asp Val Trp 

805 810 815 

Ser Tyr Gly lie Val Met Trp Glu Val Met Ser Tyr Gly Glu Arg Pro 
820 825 ' * 830 

Tyr Trp Asp Met Ser Asn Gin Asp Val lie Lys Ala lie Glu Glu Gly 
835 840 845 

Tyr Arg Leu Pro Ala Pro Met Asp Cys Pro Ala Gly Leu His Gin Leu 
850 855 860 

Met Leu Asp Cys Trp Gin Lys Asp Arg Ala Glu Arg Pro Lys Phe Glu 
865 870 875 " 880 

Gin He Val Gly He Leu Asp Lys Met He Arg Asn Pro Ser Ser Leu 

885 890 895 

Lys Thr Pro Leu Gly Thr Cys Ser Arg Pro Leu Ser Pro Leu Leu Asp 
900 " 905 910 

Gin Ser Thr Pro Asp Phe Thr Ala Phe Cys Ser Val Gly Glu Trp Leu 
915 * 920 925 

Gin Ala He Lys Met Glu Arg Tyr Lys Asp Asn Phe Thr Ala Ala Gly 
930 935 940 

Tyr Asn Ser Leu Glu Ser Val Ala Arg Met Thr He Asp Asp Val Met 
945 950 ~ 955 960 

Ser Leu Gly He Thr Leu Val Gly His Gin Lys Lys He Met Ser Ser 

965 970 975 

He Gin Thr Met Arg Ala Gin Met Leu His Leu His Gly Thr Gly He 
980 985 990 

Gin Val 
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